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(54) TIlie: DETECTION OF CERVICAL NEOPLASIAS USING FLUORESCENCE SPECTROSCOPY 
(57) Abstract 

A method for detecting tissue abnormality, particularly precancerous cervical tissue, through fluorescence spectroscopy is disclosed, 
/n vitro fluorescence measuiements over a variety of diffiennt fluorescence spectra are used to screen tissue samples. Using a principal 
component analysis (PCA). it is possible to discriminate twtween normal and dysplastic tissues with lelalivcly low false-positive and false- 
negative results. 
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DETECTION OF CERVICAL NEOPLASIAS USING FLUORESCENCE SPECTROSCOPY 



BACKGROmm OF THE IN VENTION 

The field of the invention relates to optical 
methods used for the screening and diagnosis of tissue 
10 abnormalities. In particular, the invention relates to 
the use of fluorescent spectroscopy to detect cancerous 
and precancerous tissues of the cervix. 

Cervical cancer is the second most common malignancy 
15 in women worldwide, exceeded only by breast cancer and in 
the United States, it is the third most common neoplasm 
of the female genital tract - 15,000 new cases of 
invasive cervical cancer and 55,000 cases of carcinoma in 
situ (CIS) were reported in the U.S. in 1994. In 1994, 
20 an estimated 4,600 deaths occurred in the United States 
alone from cervical cancer. However, in recent years, 
the incidence of pre- invasive squamous carcinoma of the 
cervix has risen dramatically, especially among young 
women. Women under the age of 3 5 years account for up to 
25 24.5% of patients with invasive cervical cancer, and the 
incidence is continuing to increase for women in this age 
group. It has been estimated that the mortality of 
cervical cancer may rise by 20% in the next decade unless 
further improvements are made in detection techniques. 

30 

The mortality associated with cervical cancer can be 
reduced if this disease is detected at the early stages 
of development or at the pre-cancerous state (cervical 
intraepithelial neoplasia (CIN) ) , A Pap smear is used to 
35 screen for CIN and cervical cancer in the general female 
population. This technique has a false-negative error 
rate of 15-40%. An abnormal Pap smear is followed by 
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colposcopic examination, biopsy and histologic 
confirmation of the clinical diagnosis. Colposcopy 
requires extensive training and its accuracy for 
diagnosis is variable and limited even in expert hands. 
5 A diagnostic method that could improve the performance of 
colposcopy in the hands of less experienced 
practitioners, eliminate the need for multiple biopsies 
and allow more effective wide scale diagnosis could 
potentially reduce the mortality associated with cervical 
10 cancer. 

Recently, fluorescence, infrared absorption and 
Raman spectroscopies have been proposed for cancer and 
precancer diagnosis. Many groups have successfully 

15 demonstrated their use in various organ systems. Auto- 
and dye -induced fluorescence have shown promise in 
recognizing atherosclerosis and various types of cancers 
and precancers. Many groups have demonstrated that 
autof luorescence may be used for differentiation of 

20 normal and abnormal tissues in the human breast and lung, 
bronchus and gastrointestinal tract. Fluorescence 
spectroscopic techniques have also been investigated for 
improved detection of cervical dysplasia. 

25 Despite these advances, there remains a need for 

diagnostic methods with improved accuracy and ease of 
application that also provide more rapid results. Such 
methods will pemit earlier diagnosis, more effective 
patient management and, potentially, reduce mortality. 

30 

SUMMARY OF THE INVEMTION 

Thus, it is an objective of the present invention to 
provide improved methods for the early detection of 
35 neoplasia. In particular, it is an objective of the 
present invention to provide improved spectroscopic 
methods for the identification of abnormal cervical 
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tissue, thereby providing a rapid, accurate and simple 
method for detecting cancerous or precancerous cervical 
tissue. 

5 In satisfying these and other objectives, there is 

provided a method for the optical diagnosis of tissue 
abnormalities. In one embodiment, the present invention 
provides for the detection of tissue abnormality in a 
tissue sample in vitro by illuminating a tissue sample 

10 with a series of electromagnetic radiation wavelengths 
selected to cause the tissue sample to produce a series 
of fluorescence intensity spectra indicative of tissue 
abnormality. The fluorescence intensity spectra emitted 
from the tissue sample as a result of illumination with 

15 the electromagnetic radiation are detected. Then, a 

probability that the tissue sample is normal or abnormal 
is calculated from the fluorescence intensity spectra. 

The invention further contemplates that the 
20 calculations include principal component analysis of the 
spectra, relative to a plurality of preprocessed spectra 
obtained from tissue samples of known diagnosis. The 
invention also contemplates normalizing the spectra, 
relative to a maximum intensity within the spectra, and 
25 mean- scaling the spectra as a function of a mean 
intensity of the spectra. 

The apparatus of the present invention includes a 
controllable illumination device for emitting a plurality 

30 of electromagnetic radiation wavelengths selected to 

cause a tissue sample to produce a fluorescence intensity 
spectrum indicative of tissue abnormality, an optical 
system for applying the plurality of radiation 
wavelengths to a tissue sample, a fluorescence intensity 

3 5 spectrum detecting device for detecting an intensity of 

fluorescence spectra emitted by the £?ample as a result of 
illumination by the plurality of electromagnetic 
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radiation wavelengths, a data processor, connected to the 
detecting device, for analyzing detected fluorescence 
spectra to calculate a probability that the sample is 
Gibnonnal . 

5 

These and other features and advantages of the 
present invention will become apparent to those of 
ordinary skill in this art with reference to the appended 
drawings and following detailed description. 

10 

BRIEF PgSgRTPTIOK OF m DRAWTWQS 

FIG. 1 is a graph of the Xex/Xem values for known 
cell and tissue flurophores. Also indicated are known 
15 cell and tissue f luorophores : first and second harmonic 
Rayleigh Scattering (i) ; HjO Raman Scattering (2) 
PN (NADH) (3); Fp (FADH,) (4); tryptophan (5); porphyrins 
(6); collagen (7); elastin (8); Hb absorption bands (9). 

2° FIG. 2 is a graph of the average cell pellet EEM. 

FIG. 3 is a graph of a sample cell pellet EEM with a 
high (430,520) peak. 

FIG. 4 is a graph of a sample cell pellet EEM with a 
high (250,4 00) peak. 

FIG. 5 is a scattergram of cell pellet 
classification based on principal component analysis of 
10 EEM's (spectral data only). (x) indicates samples deemed 
abnormal by Pap smear reading and (o) indicates samples 
deemed normal by Pap smear reading; 44 total = 15 normal 
and 29 abnormal. 

'5 FIG. 6 is a graph showing classification errors for 

Fisher's discriminant analysis (• - cross validation 
error estimates; ■ - Fisher's method errors). 
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FIG- 7 is a graph of typical ThinPrep versus Pellet 

Spectra for 280 nm excitation wavelength ( is thin 

prep slide; — is cell pellet) . 

5 FIG. 8 is a graph of typical ThinPrep versus Pellet 

Spectra for 370 nm excitation wavelength ( is thin 

prep slide; — is cell pellet) . 

FIG. 9 is a graph of a hypothetical distribution of 
10 test values for hypothetical samples (1 is specificity; 2 
is sensitivity) . 



15 I . Zntroduc t ion 

Clinical detection of neoplasias can be divided into 
two different kinds of analysis. First, screening 
provides a way to identify suspicious samples taken from 
a rather large pool of subjects. Subjects may be from 

20 the population as a whole or they may be part of a group 
identified as having a higher than average risk for one 
or more cancers. It is desirable that, because of the 
sheer number of tests, screening assays be relatively 
rapid, easy to conduct and inexpensive. It also is 

25 desirable that they exhibit a low false-negative rate. 

Once patients have been screened, it is necessary to 
proceed with more detailed testing that can be referred 
to generically as diagnosis. In diagnosis, the 

30 neoplasias nature of the sample is confirmed and, in 

addition, further information on the type and degree of 
dysplasia is obtained. This provides the clinician with 
an understanding of the disease state necessary to begin 
a treatment regimen. For diagnosis, cost, ease of 

35 application and rapidity are less important, though 
always desirable. It is important, however, that 
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diagnostic procedures be accurate with respect to the 
kind of cancer identified and its clinical stage. 

The present invention is an example of the first 
5 kind of detection, screening. Present screening methods, 
like Pap smears, are time intensive, require highly 
trained individuals and are relatively expensive. Even 
so, the subjective nature of the scoring often results in 
an unacceptable number of false negatives, the outcome of 
10 which can be devastating. It is believed that by using a 
more objective standard like fluorescent emissions, the 
accuracy of the screen can be improved. In addition, the 
possibility for automation has further benefits in terms 
of time and expense. 

15 

A* Method for Data naininQ PluoreBcent Spaetra 
The present invention is premised on the hypothesis 
that normal and abnormal cells will emit differing 
fluorescent spectra in response to stimulating 

20 electromagnetic radiation. In its most general form, the 
methods comprises providing a tissue sample, illuminating 
that tissue sample with electromagnetic radiation, 
detecting fluorescence of the sample and comparing the 
fluorescence of the sample with that of some standard. 

25 Each of these steps is described in greater detail, 
below. 



Obtaining a tissue sample can be achieved by any one 
of a variety of different means, largely depending on the 
nature of the sample to examined. For example, for 
examination of solid tissues, samples can be taken by 
biopsy. Alternatively, scrapings of cells can be taken 
from the tissue of interest. For examination of cells 
that are not part of solid tissue, liquid samples may be 
obtained and the cells isolated therefrom. For example, 
blood samples may be obtained by any normal methodology. 
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Aspiration of fluids also is contemplated, such as thin 
tissue aspirates. 



Once obtained, it may be necessary to further 
5 process the samples before they are examined. Further 
processing may include various forms of physical 
arrangement of the samples. For example, with solid 
tissues, it may be necessary to prepare thin sections. 
It also may be desired to dissociate the cells from each 

10 other and disperse them as a thin film monolayer. 

Dissociation may be accomplished by physical or enzymatic 
means. Similarly, dissociated cells in fluid samples or 
in scrapings may be concentrated and dispersed in a 
monolayer. In other instances, it may be desirable to 

15 concentrate disperse cells as a pellet. This can be 
accomplished by centrifugation of the liquid samples. 

Further pre- illumination processing includes 
chemical treatments such as fixation steps. In some 

20 cases, it will be desirable that the natural 

autof luorescence of the sample be unaffected. In such a 
case, the chemical treatment is selected so that the 
fluorescent species are not altered. In other cases, it 
may prove useful to use treatments that cause different 

25 autof luorescent profiles. Exemplary treatments include 
alcohol fixation. Suitable alcohols include methanol, 
ethanol, propanol, isopropanol, n-butanol and t-butanol. 

Typically, the samples are provided on a surface, 
3 0 though they can be provided in an open or closed 

container. A typical surface is a glass or quartz 
microscope slide. With certain surfaces, such as glass 
slides, there may be variation from item to item, 
requiring internal recalibration with each sample. There 
35 also may be distorting effects, especially for container- 
enclosed samples, that must be taken into account. 
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Once the samples are prepared, the illumination is 
effected. In the present invention, a variety of 
different wavelengths can be used spanning from about 200 
nm to over 700 nm. Under most circumstances, a plurality 
5 of different wavelengths will be applied, individually, 
to a single sample. Generally, the greater the number of 
wavelengths used, the better the ability to discriminate 
between physiologically distinct tissue samples- Of 
course, at some point, the intervals between wavelengths 
10 will be so small that the information achieved will 

become redundant. Those of skill in the art, knowing the 
fluorescent behavior of biological molecules, will be 
able to select both the appropriate number and range of 
wavelengths for a given purpose. 

15 

In one embodiment, wavelengths from 250 nm to 550 nm 
were applied, with 10 nm intervals. Thus, 31 different 
wavelengths were applied to a single sample in sequence. 
Because a series of different illuminations and emissions 
20 are required, it is important that the sample not be 
affected by the illuminating wavelengths. One such 
effect would be photobleaching, which has been shown to 
be significant in arterial tissue above excitation 
fluences of 80 mJ/mm*. 

25 

The next step in the method is detection. For each 
illuminating wavelength, an emission spectrum is 
determined over a range of wavelengths. Again, a series 
of wavelengths may be used with the greater number 

30 examined, the more information with which to detect 

differences between normal and abnormal tissues. The 
emission spectra are normalized to an appropriate 
Xex/Xem. The excitation-emission matrices (EEM's) may be 
plotted three dimensionally , with excitation wavelength, 

35 emission wavelength and log (1/1 (Xex/Xem) ) as the three 
axes. FIG, 2, FIG. 3, and FIG. 4 depict various EEM's. 
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In another embodiment, emissions were monitored from 
250 nm to 700 nm at 10 nm intervals. Thus, the spectra 
comprises 46 different emission readings. These readings 
were normalized to an Xex/Xem of (270,330) . Based on the 
5 results discussed below, it appears that individual 

wavelengths do not adequately discriminate between normal 
and abnormal tissues. Taken as a group, however, there 
appears to be a correlation between fluorescence values 
and pathology. In order to maximize this correlation, 
10 various statistical manipulations were applied to the 
data, as discussed in detail below, 

FIG. 1 is a graph showing the excitation/emission 
profiles for various known cell and tissue f luorophores . 
15 Those of skill in the art are aware of other potential 
natural f luorophores whose fluorescence may be used to 
generate emission spectra which may then be used in 
accordance with the present invention. 

20 B. Multi-Varlate Statistical Method Davalopment 

In order to maximize the correlation between 
fluorescence values and the physiologic state of the 
sample tissue, multi-variate statistics were applied. 
The five primary steps involved in the multivariate 

25 statistical method are 1) preprocessing of spectral data 
from each patient to account for interpatient variation, 
2) dimension reduction of the preprocessed spectra in the 
calibration set using principal component analysis, 3) 
selection of the diagnostically most useful principal 

30 components using a two-sided unpaired t-test and other 

criteria and 4) development of an optimal classification 
scheme based on Fisher's discriminant analysis using the 
diagnostically useful principal component scores of the 
calibration set as inputs with cross-validation. These 

35 five individual steps of the multivariate statistical 
method are presented below in more detail . 
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1) Preprocaaslng: The objective of preprocessing is to 
calibrate tissue spectra for inter-patient variation 
which might obscure differences in the spectra of 
different tissue types. A normalization method of 
5 preprocessing was invoked on the spectral data. 



Spectra were normalized by dividing the fluorescence 
intensity at each emission wavelength by the fluorescence 
intensity at (280 nm, 330 nm) of that sample. 
10 Normalizing a fluorescence spectrum removes absolute 

intensity information; methods developed from normalized 
fluorescence spectra rely on differences in spectral line 
shape information for diagnosis. 



15 2) Principal Conponant Analysis: Principal component 
analysis (PCA) is a linear model which transforms the 
original variables of a fluorescence emission spectrum 
into a smaller set of linear combinations of the original 
variables called principal components that account for 

20 most of the variance of the original data set. Principal 
component analysis is described in Dillon W.R., Goldstein 
M., Multivariate Analysis: Methods and Applications, John 
Wiley and Sons, 1984, pp. 23-52, the disclosure of which 
is expressly incorporated herein by reference. While PCA 

25 may not provide direct insight to the morphologic and 

biochemical basis of tissue spectra, it provides a novel 
approach of condensing all the spectral information into 
a few manageable components, with minimal information 
loss. Furthermore, each principal component can be 

30 easily related to the original emission spectrum, thus 
providing insight into diagnostically useful emission 
variables . 



Prior to PCA, a data matrix is created where each 
35 row of the matrix contains the concatenated preprocessed 
fluorescence spectra of a sample and each column contains 
the pre-procet^sed fluorescence intensity at each 
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excitation-emission wavelength pair. The data matrix D 
(r X c) , consisting of r rows {corresponding to r total 
samples from all patients in the training set) and c 
columns (corresponding to intensity at c emission- 
5 excitation wavelength pairs) can be written as: 



D = 



( ^11 Di2- • -Die 1 

^21 ^22 ' • • ^2c 



(1) 



10 



The first step in PCA is to calculate the covariance 
matrix, Z. First, each column of the preprocessed data 
15 matrix D is mean-scaled. The mean-scaled preprocessed 
data matrix, D„ is then multiplied by its transpose and 
each element of the resulting square matrix is divided by 
(r-1) , where r is the total number of samples. The 
equation for calculating 2 is defined as: 

20 

Z = ^ (D„/ DJ (2) 



The square covariance matrix, Z (c x c) is decomposed 
into its respective eigenvalues and eigenvectors. 
Because of experimental error, the total number of 
25 eigenvalues will always equal the total number of columns 
(c) in the data matrix D assuming that c < r. The goal 
is to select n < c eigenvalues that can describe most of 
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the variance of the original data matrix to within 
experimental error. The variance, V, accounted for by 
the first n eigenvalues can be calculated as follows: 



V = 100 



/ n N 
c 



(3) 



10 



The criterion used in this analysis was to retain the 
first n eigenvalues and corresponding eigenvectors that 
account for 99.9 % of the variance in the original data 
set . 

Next, the principal component score matrix can be 
calculated according to the following equation: 



R = D C 



(4) 



15 where, D (r x c) is the preprocessed data matrix and C (c 
X n) is a matrix whose columns contain the n eigenvectors 
which correspond to the first n eigenvalues. Each row of 
the score matrix R (r x c) corresponds to the principal 
component scores of a sample and each column corresponds 

20 to a principal component. The principal components are 
mutually orthogonal to each other. 



Finally, the component loading is calculated for 
each principal component. The component loading 
25 represents the correlation between the principal 

component and the variables of the original fluorescence 
emission spectrum. The component loading can be 
calculated as shown below: 
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where, CLi^ represents the correlation between the ith 
variable (preprocessed intensity at ith emission 
wavelength) and the jth principal component. C^^ is the 
5 ith component of the jth eigenvector, Xj is the jth 

eigenvalue and is the variance of the ith variable. 

Principal component analysis was performed on each 
type of preprocessed data matrix, described above. 
10 Eigenvalues accounting for 99.9% of the variance in the 
original preprocessed data set were retained The 
corresponding eigenvectors were then multiplied by the 
original data matrix to obtain the principal component 
score matrix R. 

15 

3) Student's T-Test: Average values of principal 
component scores were calculated for each histopathologic 
tissue category for each principal component obtained 
from the preprocessed data matrix. A two-sided unpaired 

20 student's t-test was employed to determine the diagnostic 
contribution of each principal component. Such a test is 
disclosed in Devore J.L., Probability and Statistics for 
Engineering and the Sciences. Brooks /Cole, 1992, and in 
Walpole R.E., Myers R.H., Probability and Statistics for 

25 Engineers and Scientists, Macmillan Publishing Co., 1978, 
Chapter 7, the disclosures of which are expressly 
incorporated herein by reference. The hypothesis that 
the means of the principal component scores of two tissue 
categories are different were tested for normal smears 

30 and abnormal smears (ASCUS, LGSIL, HGSIL) . Principal 
components were ranked in order of increasing p value. 
Fisher's discriminant analysis was performed using the 
most significant principal components and method 
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performance was evaluated. Principal components were 
then added one at a time and Fisher's discriminant 
analysis was again performed. This process was repeated 
until no further improvement was reached or a decrease in 
5 performance was noted. Principal components chosen in 
this manner were used in the diagnostic method. 



4) Logistic Discrimination: Logistic discriminant 
analysis is a statistical technique that may be used to 

10 develop diagnostic methods based on posterior 

probabilities, overcoming the drawback of the binary 
decision scheme employed in the two-stage method. This 
statistical classification method is based on Bayes 
theorem and may be used to calculate the posterior 

15 probability that an unknown sample belongs to each of the 
possible tissue categories identified. Logistic 
discrimination is discussed in Albert A., Harris E.K., 
Multivariate Interpretation of Clinical Laboratory Data, 
Marcel Dekker, 1987, the disclosure of which is expressly 

20 incorporated herein by reference. Classifying the 

unknown sample into the tissue category for which its 
posterior probability is highest results in a 
classification scheme that minimizes the rate of 
misclassif ication . 

25 

For two diagnostic categories, and Gj, the 
posterior probability of being a member of G^, given 
measurement x, according to Bayes theorem is: 



p(G IX) = P(x|G )P(G,)C(2|1) 

^ P(xiG,)P(G,)C(2|l) +P(x|G2)P(G2)C(ll2) 

30 

where P{x|Gi) is the conditional joint probability that a 
tissue sample of type i will have principal component 
score X, and PCGJ is the prior probability of finding 
tissue type i in the sample population. C(j|i) is the 
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cost of misclassifying a sample into group j when the 
actual membership is group i. 

The prior probability P(Gi) is an estimate of the 
5 likelihood that a sample of type i belongs to a 
particular group when no infomtation about it is 
available. If the sample is considered representative of 
the population, the observed proportions of cases in each 
group can serve as estimates of the prior probabilities. 

10 In a clinical setting, either historical incidence 

figures appropriate for the patient population can be 
used to generate prior probabilities, or the 
practitioner's colposcopic assessment of the likelihood 
of precancer can be used to estimate prior probabilities, 

15 as is known in the art. 

The conditional probabilities may be developed from 
the probability distributions of the n principal 
component scores for each tissue type, i. The 

20 probability distributions may be modeled using the gamma 
function, which is characterized by two parameters, alpha 
and beta, which are related to the mean and standard 
deviation of the data set. The normal function is 
typically used to model probability distributions and is 

25 defined below: 

f{x) - (l/(V27r)E)e-<''-/*^^»=» (7) 

The normal distribution function may be used to 
30 calculate the conditional probability that a sample from 
tissue type i, will exhibit the principal component 
score, X. If more than one principal component is needed 
to describe a sample population, then the conditional 
joint probability is simply the product of the 
35 conditional probabilities of each principal component 

(assuming that each principal component is an independent 
variable) for that sample population. 
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Fisher's discriminant analysis is a particular 
statistical technique for classifying individuals or 
objects in to mutually exclusive and exhaustive groups on 
the basis of a set of independent variables. In this 
5 particular application of Fisher's method, the objects 
are N patient cytological samples, the groups are the 
diagrnostic classifications (normal versus abnormal) and 
the P variables are the principal components X derived 
from the fluorescence spectra of the samples. 

10 

Fisher's method calculates a score Y for each of the 
samples, based on a linear combination of the variables, 
i.e. , 

15 Y(N) = b/X/{N) + b/X/(N) + ... + bp*Xp(N) 

The coefficients bj through bp are calculated so that the 
difference between the scores for the normals and the 
abnormals is maximized. Assuming that the X is normally 
20 distributed for the two groups, and assuming that the 
covariance s of X is the same for the two groups, then 
the best choice for b^ is 

bl = s'^* (avg. of x^ of norm. - avg. of x^ for abnorm.) 

25 

and similarly for bj through bp. Then, a cutoff value for 
Y is selected and all samples with scores above the 
threshold are classified as belonging to the first group, 
normals, and samples with scores below the threshold are 

30 classified as belonging to the second group, abnormals. 
Since there is overlap in the distributions of Y for the 
two groups, some samples will be misclassif ied no matter 
where the cutoff is chosen. The cutoff is chosen to be 
the one that results in the lowest misclassif ication 

35 rate. The cutoff value given the above assumptions is 

Yc = (n^Y, + n^Yj) / (n^ + nj 
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where is the number of samples in group 1, and is 
the Y score using the average values of the X variables 
for group 1, likewise for group 2. Y^ can be adjusted 
from this value to reduce the FN rate at the expense of 
5 the FP rate, or vice versa, depending on the application. 

Since both the b and Yc values are calculated from 
the data, it may be asked how well this method will 
classify new samples, whose values for X were not used in 

10 the above -calculat ions . This performance can be 

estimated by using cross-validation techniques. For each 
sample, b and Yc are calculated using the other sample 
data, and then the method is used to classify that 
sample. The misclassif ication error rate for all samples 

15 is measured this way is taken as an unbiased estimate of 
what one can expect when using Fisher's discriminate 
analysis to classify new samples. Dillon and Goldstein 
(1985) . 

20 II. Exanqples 

In order to evaluate the neoplastic diagnostic 
potential of cellular autof luorescence, excitation of 
exfoliated, ethanol- fixed cervical squamous epithelial 
cells with a plurality of wavelengths was performed. In 
25 addition, the effects on the fluorescent spectra of 
different specimen preparation methods were examined 
using both cell pellet and monolayer preparations made 
from the same samples . 

30 A. g«fnpl<» Preparation 

Exfoliated cervical cells were obtained from 
patients referred to MD Anderson Cancer Center for 
routine screening, as well as colposcopy, on the basis of 
previous abnormal cervical cytology. Conventional Pap 
3 5 smears were obtained and the normally discarded cells 
remaining on the swab suspended in an ethanol -based 
fixative (PreservCyt Solution, Cytyc Corp., Marlborough, 
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MA) . From each suspension, two types of samples were 
prepared. First, a monolayer cell touch prep was 
prepared onto an ordinary glass microscope slide using 
the CYTYC Thinprep device (Hutchinson, 1992) . This 
5 device extracts an aliquot from the suspension and 

filters it to remove red blood cells and other debris 
smaller and larger than epithelial cells. The cells were 
deposited in an circular area 20mm in diameter. 

10 The remaining cells in suspension were centrifuged 

and resuspended three times in HPLC-grade ethanol to 
remove the fluorescent preservative solution. The number 
of cells in suspension was determined using a 
hemocytometer and found to vary between 1 - 50 x 10* 

15 cells. The cells were then spun down to a pellet, placed 
on a spot approximately 3 mm in diameter on a quartz 
microscope slide and air dried. 

B. Fluorascant Spectroscopy Apparatus and 

20 Conditions 

All fluorescence measurements were made using a 
standard scanning fluorimeter (SPEX, Fluorolog II, 
Edison, NJ) with a spectral resolution of 5 nm FWHM. 
Beam area was approximately 2 mm=. Slides were placed so 

25 that the cells were on the side facing the beam, and the 
beam was focused onto this surface of the slide. 
Excitation light was incident normally and emission was 
collected at an angle of 20 degrees from the normal. The 
signal at each (Xex,Xem) value was integrated for 2 

30 seconds. Data were corrected for the non-uniform 

spectral response of the emission monochromator and 
detector using correction factors supplied with the 
instrument. Also, spectra were corrected for variability 
with wavelength in the intensity of the excitation source 

35 using a Rhodamine B quantum counter (20) . 
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Fluorescence excitation-emission matrices (EEM's) 
were recorded for each cell pellet sample. Excitation 
wavelengths ranged from 250 to 550 nm, in 10 nm 
increments, and emission was measured from 10 nm above 
5 the excitation wavelength to 10 nm below the 2nd harmonic 
of the excitation wavelength, up to 700 nm, in 10 nm 
increments. A background EEM was recorded from a quartz 
slide containing supernatant only from the resuspension 
of one sample. This background EEM was subtracted from 
10 each pellet EEM. For several samples, a second emission 
spectrum was recorded at 250 nm excitation after the 
initial EEM to check for photobleaching effects. All 
EEMs were normalized with respect to the intensity at 
(270,330) . 

15 

For the ThinPrep samples, one excitation spectrum 
from 350 nm to 410 nm at 440 nm emission was recorded, 
and two emission spectra were recorded, at excitations of 
280nm and 370nm, using the same increments and spectral 
20 ranges as the EEMs above. Since the UV fluorescence of 
the glass slides differed between slides, a background 
spectrum was recorded from each slide from a region with 
no cells. All spectra were normalized with respect to 
the intensity at (280,330). 

25 

C. Pap Smears 

Conventional Pap smears were read by staff 
cytopathologists at M.D. Anderson Cancer Center. 
Diagnosis was done using the standard diagnostic 
30 classifications, where the samples are classified as 
Normal, HPV, CIN I (mild dysplasia), CIN II (moderate 
dysplasia) , or CIN III (severe dysplasia or carcinoma in 
situ) , and Squamous Carcinoma (Vooijs, 1991) , 

35 Samples were obtained from 80 patients, of which 3 6 

were not used due to inconclusive diagnoses, samples too 
dilute, contamination with blood or other foreign matter 
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such as cotton from swabs used in the sample collection, 
or errors made in the data collection. Of the remaining 
44 samples, 15 had conventional Pap smears which were 
diagnosed as normal (negative for malignant cells) and 29 

5 were read as abnormal (HPV - 14; CINI - 9; CINII - 10; 
CINIII - 10) . The ages ranged from 20 to 52, with the 
average age of 33. There were 21 whites, 7 hispanics, 15 
blacks, and 1 oriental. For comparison of the pellet to 
the ThinPrep preparations, 23 samples were selected from 

0 the patients mentioned above, of which 10 were normal and 
13 abnormal. 



Fixed Ca ll Pluoraaemeii 

FIG. 2 is a graph of the average EEM for all 44 
pellet prepared samples used. All plots are normalized 
to the (270,330) values and plotted on a log scale. 
Contour plot lines are spaced evenly also on a log scale. 
The most intense fluorescence peak is at (280,330), 
characteristic of tryptophan. A peak also is present at 
(370,450), and a slight shoulder at (280,450), both 
characteristic of FN. A shoulder is also present at 
(250,400), due to contaminants of unknown origin. There 
is a slight shoulder near (430,520) due to flavoprotein 
as well as a valley in the excitation spectra 
corresponding to the Soret Hb absorption line at \ex = 
415 nm. No significant photobleaching of the cells was 
observed. 



Although the tryptophan and FN peaks were present in 
all samples, the intensity of the FN and Fp peaks and 
other features varied greatly among the pellet sample 
EEM's. FIG. 3 shows an EEM of a sample with the 
(430,520) peak more pronounced. The Soret absorption 
band is also clearly visible, while the (250,400) peak is 
absent completely. For the sample EEM in FIG. 4, the 
peak at (250,400) is even more intense than the (280,330) 
tryptophan peak. 
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Table 1 lists these spectral features, and the 
average and standard deviation for each. All features 
showed considerable variation among samples, with the 
magnitude of the standard deviation exceeding the mean in 
5 most cases. The (280,330) peak stability can be 

attributed to the fact that the spectra were normalized 
to the nearby (270,330) value. The EEM value with the 
least percent variance was at (300,360), between the 
tryptophan and PN peaks. 

10 

TABLE 1: SPECTRAL FEATURES OF CELL PELLET EEM's* 



feature 

{AmMt) 


all samples 


nor. sample 


abn. samples 


nor V abn 


M 




/J 


a 


A/ 


a 


P 


(280,330) 


1.18 


0.16 


1.16 


0.16 


1.19 


0.17 


0.56 


(250,400) 


0.29 


0.61 


0.33 


0.62 


0.28 


0.62 


0.78 


(280,450) 


0.08 


0.09 


0.08 


0.09 


0.08 


0.09 


0.93 


(370,450) 


0.23 


0.31 


0.21 


0.29 


0.24 


0.33 


0.75 


(430,520) 


0.11 


0.13 


0.09 


0.10 


0.12 


0.15 


0.45 


(280,450)/ 
(370,450) 


0.36 


0.21 


0.40 


0.32 


0.34 


0.28 


0.08 


(300,360) 


0.44 


0.13 


0.39 


0.11 


0.46 


0.13 


0.04 



* - all values normalized to intensity at (270,330) 

25 

Table 1 also compares population statistics for the 
normal and abnormal sample groups. For each feature, the 
variance within each group exceeded the difference 
between the two groups, as reflected in the high p- 
30 values. The lowest p-scores for all values or ratios of 
values in the EEMs are at the ratio of 
(280,450) / (370,450) and at (300,360). 



In order to test the hypothesis that a combination 
35 of several variables may identify significant differences 
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between normal and abnormal cells, principal component 
analysis (PCA) was used to generate a reduced set of 
variables which are linear combinations of the EEM 
values- From each EEM, 25 principal components (PC's) 
5 accounting for 99.99% of the variance between all samples 
were calculated. None of the PC's had a higher 
statistical significance than any of the features in 
Table 1. Table 2 provides this comparison. 

10 



TABLE 2 





%VAR 


mean-nor 


mean-ab 


sidov-nor 


sidev-ab 


p-score 


•18 


0.0193 


0.0207 


0.0107 


0.0507 


0.0324 


0.041305 


• 5 


1.4136 


-0.1318 


0.0682 


0.3042 


0.3685 


0.063457 


• 3 


4.3008 


-0.2137 


0.1105 


0.5753 


0.6279 


0.096108 


22 


0.0047 


0.0050 


-0.0026 


0.0144 


0.0229 


0.187096 


•16 


0.0364 


0.0150 


•0.0077 


0.0491 


0.0605 


0.188685 


23 


0.0036 


0.0042 


-0.0022 


0.0169 


0.0184 


0.25996 


•10 


0.1573 


0.0284 


•0.0147 


0.1183 


0.1191 


0.263165 


19 


0.0106 


0.0074 


-0.0038 


0.0338 


0.0292 


0.28409 


13 


0.0627 


0.0146 


-0.0076 


0.0651 


0.0801 


0.329235 


25 


0.0023 


0.0032 


-0.0016 


0.0181 


0.0120 


0.363534 


14 


0.0463 


0.0122 


•0.0063 


0.0663 


0.0641 


0.382071 


12 


0.0801 


•0.0139 


0.0072 


0.0946 


0.0806 


0.469852 


IS 


0.0442 


0.0084 


•0.0043 


0.0694 


0.0605 


0.553599 


8 


0.3687 


•0.0211 


0.0109 


0.1569 


0.1962 


0.561494 


7 


0.5198 


0.0243 


•0.0126 


0.2173 


0.2192 


0.599062 


1 


66.9477 


•0.2033 


0.1052 


2.1941 


2.6173 


0.682091 


21 


0.0062 


0.0019 


•0.0010 


0.0226 


0.0246 


0.695786 


17 


0.0224 


0.0039 


-0.0020 


0.0534 


0.0408 


0.707169 


9 


0.2596 


-0.0104 


0.G054 


0.1533 


0.1555 


0.75051 
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pn# 


%VAR 


mean-nor 


mean-ab 


sidev-nor 


sidev-ab 


p-seore 


20 


0.0079 


-0.0013 


0.0007 


0.0233 


0.0288 


0.800828 


24 


0.0031 


-0.0008 


0.0003 


0.0196 


0.0156 


0.876723 


2 


22.881 


0.0348 


0.0180 


1.4811 


1.4414 


0.910808 


4 


1.9629 


0.0080 


•0.0042 


0.4774 


0.3980 


0.933139 
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1 pc# 


%VAR 


inun-Ror 


mMiMib 


sidtv*nor 


sidev-ab 


p-score 


1 ^ 


0.7267 


0.0039 


•0.0020 


0.2047 


0.2826 


0.936508 


1 11 


0.112 


0.0005 


-0.0003 


0.1313 


0.0832 


0.983305 



* used in Ox algorithm 
5 total %VAR used - 5.927% 

A discriminant function was found using Fisher's 
discriminant analysis (FDA), assuming equal prior 
probabilities and variable cost of misclassif ication. 

10 This function used 5 of the PCs above which together 
account for 5.93% of the sample -to -sample variance 
(3,5,10,16,18). The discriminant score calculated for 
each sample is plotted in FIG. 5, where samples above 
are classified as normal, samples below Y,. abnormals. The 

15 Pap smear diagnosis is shown as well (o - normals, x - 
abnormals) . The false positive rate (FP) is 2/15 = 
13.4%, while the false negative rate (FN) is 12/29 = 
41.4%, where the Y^ is 0 (solid line). Another choice is 
Y^ is 0.6, where the FN is 4/29 = 13.8% and the FP is 5/15 

20 = 33.3% (brolcen line). A plot of the FN versus FP, 

obtained by varying Y^ in FIG. 5, is shown in FIG. 6, with 
the diagonal line representing random classification. 
The expected performance of the discriminant function on 
additional samples was estimated using cross validation, 

25 with the results shown in FIG. 6. 



S- ThinPren vb Pellet Autof luoreacenea 

FIG, 7 compares ThinPrep and pellet emission spectra 
from a typical sample for 280 nm excitation. For all 
samples, the spectra are similar in shape, except that 
the ThinPrep spectra are blue shifted by an average of 10 
nm. In FIG. 8, the spectra at 370 nm excitation are 
compared for a typical case. Again, the ThinPrep 
spectrum is blue-shifted with respect to the pellet 
spectrum. In addition, the intensity at the ThinPrep PN 
peak around (370,450) is 5- to 10-times lower than the 
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corresponding pellet spectra. The variance between 
samples is reduced for the ThinPrep spectra, having a 
lower standard deviation as a percentage of the mean, 
58%, compared to 82% for the pellet data. The excitation 
5 spectra of the ThinPrep slides recorded at 440 nm 
emission showed considerable variance below 350 nm 
excitation due to the strong UV emission of the glass 
slides. Above 350 nm excitation, the spectra differed 
from the pellet spectra in the degree of variance between 
10 samples and in the intensity, as described above for the 
(370,450) peak. 
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APPENDIX I: SPECIPICITY AMD SENSITIVITY 

Summarized from: Albert A,, Harris E.K. : Multivariate 
Interpretation of Clinical Laboratory Data, Marcel Dekker 
5 Inc., New York, pp. 75-82, (1987), the disclosure of 
which is expressly incorporated herein by reference. 

Assuming a group of T samples which can be 
categorized as normal (N samples) or diseased (D 

10 samples) . A diagnostic test, designed to determine 

whether the sample is normal or diseased, is applied to 
each sample. The results of the tests is the continuous 
variable x, which is then used to determine the sample 
type. FIG. 9 illustrates a hypothetical distribution of 

15 test values for each sample type. A diagnostic method 
based on this test can easily be defined by choosing a 
cutoff point, d, such that a sample with an observed 
value x<d is diagnosed as normal and a sample with an 
observed value x*d is diagnosed as abnormal . 

20 

Several quantitative measures have been defined to 
* evaluate' the performance of this type of method. The 
first type evaluates the test itself (i.e., measures the 
ability of the test to separate the two populations, N 

25 and D) . Sensitivity and specificity are two such 

measures. The second type is designed to aid in the 
interpretation of a particular test result (i.e. deciding 
whether the individual test measurement has come from a 
normal or diseased sample) . Positive and negative 

30 predictive value are two measures of this type. 

To define these measures, some terminology and 
notation must be introduced. Referring to Table 3, a 
sample to be tested can be either normal or diseased; the 
35 result of the test for each type of sample can be either 
negative or positive. True negatives represent those 
normal with a positive test result. In these cases, the 
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diagnosis based on the rest result is correct. False 
positives are those normal samples which have a positive 
test result and false negatives are those diseased 
samples which have a negative test result. In these 
5 cases, the diagnosis based on the test result is 
incorrect . 



TABLE 3 





Normal 




Total 
Sanplaa 


Teat 

Negative 

(x<d) 


True 
Negatives 
(TN) 


False 
Negatives 
(FN) 


Negatives 
(Neg) 


Teat 

Poaitive 
(xsd) 


False 
Positives 
(FP) 


True 
Positives 
(TP) 


Positives 
(Pos) 


Total 
Sanqplea 


N 


D 


T 



20 

With this terminology. Table 4 contains a definition 
of sensitivity and specificity, the two measures which 
assess the performance of the diagnostic method. 
Specificity is the proportion of normal samples with a 

25 negative test result (proportion of normal samples 

diagnosed correctly) . Sensitivity is the proportion of 
diseased samples with a positive test result (Proportion 
of diseased samples correctly diagnosed) . FIG. 9 also 
contains a graphical representation of specificity and 

3 0 sensitivity. Specificity represents the area under the 
normal sample distribution curve to the left of the cut 
off point while sensitivity represent the area under the 
diseased sample distribution curve to the right of the 
cut off point. 

35 
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TABLE 4 



TMt 

Mttasure 


Meaning 


Calculation 


Specificity 


Proportion of normal 

samples with 
negative test result 


Sp»TN/N 


Sensitivity 


Proportion of 
diseased samples 
with positive test 
result 


Se=TP/D 



While sensitivity and specificity characterize the 
performance of a particular method, another set of 
statistics is required to interpret the laboratory test 
result for a given specimen. The positive and negative 
predictive value quantify the meaning of an individual 
test result (Table 5) . The positive predictive value is 
the probability that if the test result is positive, the 
sample is diseased. The negative predictive value is the 
probability that if the test result is negative, the 
sample is normal. Positive and negative predictive value 
are calculated from Baye's rule as outlined in Albert and 
Harris. Table 5 contains two equivalent formulas for 
calculation positive and negative predictive value. 



TABLE 5 



25 



Measure 


Meaning 


Calculation 

1 


Calculation 
2 


Positive 
Predic- 
tive 
Value 


The probability 
that, if the 

test is 
positive, the 
sample is 
diseased 


PV,=TP/Pos 


PV,=DSe/ (DSe 
+N(l-Sp) ) 


Negative 
Predic- 
tive 
Value 


The probability 
that, if the 

test is 
negative, the 
sample is normal 


PV.=TN/Neg 


PV. 

=NSp/(NSp+D 
(1-Se) ) 
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APPENDIX ZI: PRINCIPAL COMPONENTS 



1 


ex(nm) 


eni(nin) 


evec3 


evec5 


eveclO 


eveclS 


evecIS 1 




250 


260 


0.155 


0 .353 


-0 .031 


-0.058 


-0.055 1 


5 


250 


290 


0.058 


0.097 


-0 . 020 


0 . 077 


0.109 1 




250 


300 


0.024 


0 .055 


-0 . 033 


0 . 035 


0 . 096 




250 


310 


0.028 


0.106 


-0.023 


0.064 


0.186 1 




250 


320 


0.037 


0.153 


0 .000 


0.119 


0.168 






250 


330 


0.016 


0.188 


-0.001 


0.052 


0.034 


10 


250 


340 


-0.004 


0.197 


-0.012 


-0.003 


-0.064 1 




250 


350 


-0.014 


0.187 


-0.046 


-0.030 


-0.140 




250 


360 


-0.015 


0.132 


-0.051 


-0.029 


-0.134 




250 


370 


-0.011 


0.060 


-0.040 


-0.010 


-0.110 




250 


380 


-0.004 


-0.017 


-0.013 


0.012 


-0 .039 


15 


250 


390 


-0.003 


-0.069 


0.019 


0-021 


-0 .010 




250 


400 


-0.005 


-0.088 


0.049 


-0 . 001 


0 . 010 




250 


410 


-0.010 


-0.082 


0 .051 


-0 . 009 


0 . 006 




250 


420 


-0.012 


-0 . 061 


0 . 046 


-0 . 008 


0.061 1 




250 


430 


-0 . 013 


-0 . 047 


0 . 044 


- 0 . 006 


A A C O II 

O.Ob? 


20 


250 


440 


-0 . 013 


-0 . 031 


0.036 


A A AO 
0 . 00 J 


A AC"! II 

0 . 0 b J 11 




250 






_ n noo 
~ U . ^ 




- 0 001 
yj • \j \j X. 


0 06 0 1 

W . V/ U V II 




250 


460 


-0.010 


-0.014 


0.034 


0 .006 


0.054 1 




260 


290 


-0.013 


0.017 


-0.063 


-0.046 


-0.101 




260 


300 


-0.030 


-0.026 


-0.102 


-0.134 


-0.044 


25 


260 


310 


-0 .009 


0.007 


-0.093 


-0.019 


0.085 




260 


320 


-0 . 003 


0.035 


-0. 078 


0.054 


0.160 
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1 ex(nm) 


em(nm) 


evec3 


evec5 


eveclO 


evecie 


eveclS 


1 260 


330 


-0.029 


0.060 


-0.091 


0.036 


0.052 


260 


340 


-0.045 


0.085 


-0.121 


-0.009 


-0.008 


260 


350 


-0.046 


0.087 


-0.143 


-0.031 


-0.088 


260 


360 


-0.037 


0.066 


-0.136 


-0.004 


-0.119 


1 260 




370 


-0.029 


0.042 


-0.115 


-0.005 


-0.094 


260 


380 


-0.023 


0.015 


-0.077 


-0.006 


-0.088 


260 


390 


-0.021 


-0.007 


-0.039 


-0.011 


-0.062 


260 


400 


-0.021 


-0.021 


-0.010 


-0.022 


-0.048 


260 


410 


-0.023 


-0.025 


0.010 


-0.030 


-0 . 039 


260 


420 


-0.023 


-0.023 


0.022 


-0.031 


-0.009 


260 


430 


-0.022 


-0.023 


0.031 


-0.027 


-0.024 


260 


440 


-0.021 


-0.020 


0.037 


-0.029 


-0.026 


260 


450 


-0.019 


-0.018 


0.046 


-0.028 


-0.026 


260 


460 


-0.016 


-0.015 


0.044 


-0.024 


-0.032 


260 


470 


-0.014 


-0.014 


0.044 


-0.023 


-0.027 


260 


480 


-0.012 


-0.011 


0.039 


-0.020 


-0.020 


270 


300 


-0.032 


-0.105 


-0.059 


0.017 


-0.173 


270 


310 


0. 008 


-0.068 


-0.036 


-0.005 


-0.040 


270 


320 


0.026 


-0.035 


-0.014 


0.006 


0.028 


270 


330 


0.000 


0.000 


0.000 


0.000 


0.000 


270 


340 


-0.020 


0.038 


-0.025 


0.011 


0.002 


270 


350 


-0.026 


0.046 


-0.058 


0.073 


-0.024 


270 


360 


-0.021 


0.035 


-0.062 


0.124 


-0.039 


270 


370 


-0 . 017 


0.017 


-0.058 


0.140 


-0.048 
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ex (nm) 


etn (nm) 


evec3 


evecS 


eveclO 


evecl6 


eveclS 


270 


380 


-0 . 016 


0 . 004 


-0.036 


0.115 


-0. 042 


270 


390 

W ^ v 


-0 020 


-0 . 005 


-0.009 


0.086 


-0 . 035 1 




dOO 


-0 02A 


-0 012 


0 Oil 


0 . 062 


-0.038 


^ t \j 


A1 O 

H X W 


w • U «9 U 


- 0 016 


0 . 027 


0 . 046 


-0 . 042 






-A A^ 1 
— U ♦ U J X 


• U • w X^ 


0 040 


0 035 


-0 oil 

w . V/ X X 


^ /u 


A n 


A A ^ T 


- A A 1 <^ 
" U . UXD 


0 om 

U • UwX 


0 035 


- 0 03fl 


^ /U 


A Af\ 
44 U 


A A'^ A 


-A A 1 1^ 
- U . UXD 


U . U D 7 


O OOB 
u • u ^ o 


_ n n A n 




A c n 


A AO •? 


A A1 C 
- U • UXD 


U • UD O 
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APPENDIX III: REFERENCES 

The disclosures of the following publications, 
patents and applications are expressly incorporated 
5 herein by reference, as are any of the other references 
mentioned above: 

A two-stage fluorescence diagnostic method is 
disclosed in detail in application Serial No, 08/060,432, 
10 filed May 12, 1993, and is assigned to the same assignee 
as the present invention. 

An application entitled "Optical Method And 
apparatus for the Diagnosis of Cervical Precancers using 
15 Raman and Fluorescence Spectroscopies" (Richards -Kortum 
et al.) was filed on March 14, 1995, and is assigned to 
the same assignee as the present invention, 

Lui et aJ., "Fluorescence and Time -Resolved Light 
20 Scattering as Optical Diagnostic Techniques to Separate 
Diseased and Normal Biomedical Media," J. Photochem. 
Photobiol. B: Biol. 16, 187-209, 1992. 

Wong et al., "Infrared Spectroscopy of Human 
25 Cervical Cells: Evidence of Extensive Structural Changes 
during Carcinogenesis", Proc. Nat'l Acad. Sci. USA, 88, 
10988-10992, 1991 . 

Kurman RJ, Henson DE, Herbst AL, Noller KL, Shiffman 
30 MH, "Interim guidelines for management of abnormal 
cervical cytology," JAMA, 271, 1866-1869, 1994. 

Davison JM, Marty JJ, "Detecting premalignant 
cervical lesions: Contributions of screening colposcopy 
35 to cytology," J Reproductive Medicine, 39, 388-392, 1994. 
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Banda-Gamboa, Hugo, et al, "Automation in cervical 
cytology: an overview," Anal Cell Path, 4 (1992) 25-48. 

Chance B, Thorell L, "Localization and kinetics of 
5 reduced PN in living cells by microf luorimetry, » J Biol 
Chem 234:3044, 1959. 



Caspersson TC, Cell Structure and Function, ch 4, 
Norton & Co, New York, 1950. 

Aubin JE, "Autof luorescence of Viable Cultured 
Mammalian Cells," J Histochem Cytochem, 27(1), 36-43, 
1979. 



Benson RC, et al, "Cellular Autof luorescence : Is It 
Due to Flavins?" J Histochem Cytochem, 27(1), 44-48, 
1979. 



Tamura M, ... Chance B, "In Vivo Study of Tissue 
20 Oxygen ...," Annu J?ev Physiol, 1989, 51:813-814. 

Hutchinson, ML, Agarwal, P, Denault, T, Berger, B, 
Cibas, ES, "A New Look at Cervical Cytology," Acta 
Cytologica, 36, 4:499-504, 1992. 

25 

Taylor DG, Demas JN, "Light intensity measurements 
I: Large area bolometers with uW sensitivities and 
absolute calibration of the rhodamine B quantum counter," 
Anai Chem, 51, 712-717, 1979. 

30 

Vooijs GP, "Benign proliferative reactions, 
intraepithelial neoplasia and invasive cancer of the 
uterine cervix, " in Bibbo M (ed) , Comprehensive 
Cytopathology, WB Saunders Company, pp 176-177, 1991. 
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Dillon WR and Goldstein M, "Multivariate Analysis 
Methods and Applications," Chapter 10. Wiley & Sons, New 
York (1985) 
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1. A method of detecting tissue abnormality in a tissue 
sample in vitro comprising: 

(i) providing a tissue sample; 

(ii) sequentially illuminating said tissue sample in 
vitro with a set of at least two 
electromagnetic radiation wavelengths selected 
to cause said tissue sample to produce a set of 
fluorescence intensity spectra indicative of 
tissue abnormality; 

(iii) detecting said set of fluorescence intensity 
spectra emitted from said tissue sample as a 
result of illumination with each of said 
wavelengths; and 

(iv) calculating from said set of fluorescence 
intensity spectra, a probability that said 
tissue sample is normal or abnormal. 



2. The method of claim i, wherein said calculating st 
comprises, conducting principal component analysis of 
said fluorescent spectra relative to a set of 
preprocessed spectra obtained from tissue samples of 
known pathology. 



3. The method of claim 2, wherein said principal 
component analysis does not include the highest and 
lowest order principal components. 

35 
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4. The method of claim 3, wherein said principal 
component analysis comprises a Fisher's determinant 
analysis. 

5 

5. The method of claim 1, wherein said calculating step 
comprises, normalizing said spectra relative to a maximum 
intensity within said spectra. 

10 

6. The method of claim 5, wherein said calculating step 
further comprises, mean-scaling said spectra as a 
function of a mean intensity of spectra. 

15 

7. The method of claim 1, wherein said providing step 
comprises obtaining said tissue sample by biopsy. 

20 8. The method of claim 7, wherein said providing step 

further comprises ethanol fixation of said tissue sample, 

9, The method of claim 8, wherein said providing step 
25 even further comprises generating a monolayer cell touch 

preparation or a pellet. 

10. The method of claim 1, wherein said set of at least 
3 0 two electromagnetic wavelengths are 250 nm, 550 nm and 

all wavelengths between 250 nm and 550 nm at 10 nm 
intervals . 

35 11. The method of claim 1, wherein said detecting step 
comprises, detecting an intensity of fluorescence at 250 
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nm, 700 nm and all wavelengths between 250 nm and 700 nm 
at 10 nm intervals. 



5 12. The method of claim 1, wherein said illuminating 
comprises, illuminating said sanple substantially normal 
to a surface of said sample, and wherein said detecting 
step comprises, detecting said spectra at an angle of 
approximately 20' from normal. 

10 

13. The method of claim 1, wherein at least 10 
sequential electromagnetic radiation wavelengths are used 
in said illuminating step to produce a set of at least 10 
15 different fluorescence intensity spectra in said 

detecting step, each spectra comprising fluorescence 
intensity at at least 11 wavelengths. 



20 14. The method of claim 13, wherein at least 30 

sequential electromagnetic radiation wavelengths are used 
in said illuminating step to produce a set of at least 30 
different fluorescence intensity spectra in said 
detecting step, each spectra comprising fluorescence 

25 intensity at at least 31 wavelengths. 



15. The method of claim 14, wherein at least 50 
sequential electromagnetic radiation wavelengths are used 
in said illuminating step to produce a set of at least 50 
different fluorescence intensity spectra in said 
detecting step, each said spectra comprising fluorescence 
intensity at at least 51 wavelengths. 
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